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METHOD AND APPARATUS FOR PUBLISHING DOCUMENTS OVER A 
NETWORK 

BACKGROUND OF THE INVENTION 

Field of the Invention 

5 The invention relates to publishing electronic documents over a network, 

such as a wide area network, the World Wide Web, etc.; more specifically, the 
invention relates to controlling access to documents accessible over a network. 

Description of Related Art 

Publishing electronic documents on a network such as a wide area network is 
10 known in the art. For example, see U.S. Patent No. 5,870,552 issued to Dozier. These 
systems are usually client-server systems in which clients make requests to the 
server, and the server generates responses. Client-server platforms are used for 
U authoring of dociiments, content-based indexing of documents, retrieval of 

•t, z 

^^S documents, management and control of dociraients, and support for developing 

15 form-driven interactive services. 

In typical client-server document systems, a doctiment is stored on a server 
from which a user accesses, edits, and/ or restores the document. While providing 
users with an opportunity to perform various tasks on a document stored on a 
server, such systems and the access to documents thereon lack important security 
20 measures. For example, a document stored on a server without some security 
mechanism (e.g., a security key) does not prevent unscrupulous users from 
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accessing the docximent. Moreover, because documents may reside on a server for 
an indefinite period of time, there is an increased risk that an tinauthorized user may 
gain access to a document. Currently, there is no system that ensures that only 
authorized users gain access to a document published on a network and that 
prevents documents from being maintained for long periods of time on a server. It 
is, therefore, desirable to have a system that uses security measures to prevent 
unauthorized individuals from gaining access to documents residing on a server 
and that reduces the risk of imauthorized access. 
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SUMMARY OF THE INVENTION 

A system for publishing an electronic document on a network is described. 
In one embodiment, the system comprises a document source, a filter, and a server. 
The document source supplies an electronic document. The document source may 
be a multifunction machine. The filter transforms the electronic document for 
publication. The server has a memory for storing the electronic document and 
permits access to the electronic document using a document identifier. The server 
also sends, to a publisher designated location, information regarding accesses to the 
electronic document. 
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BRIEF DESCRIFnON OF THE DRAWINGS 



The features, aspects, and advantages of the invention will become more 
thoroughly apparent from the following detailed description, appended claims, and 
accompanying drawings in which: 

Figure 1 is a block diagram of one embodiment of a document publishing 
system; 

Figure 2 is a block diagram of one embodiment of a high-level architecture 
for a typical client-server network; 

Figure 3 shows one embodiment of a computer program with various 
modules that may be executed by one or more processing devices; and 

Figure 4 is a flow chart of one embodiment of a process for providing access 
to an electronic document. 
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DETAILED DESCRIPTION OF THE INVENTION 



In the following description, nximerous specific details are set forth to provide 
a thorough understanding of the invention. However, it will be understood by one 
of ordinary skill in the art that the invention may be practiced without these specific 
details. In other instances, well known structures and techniques have not been 
shown in detail in order not to obscure the invention. 

Some portions of the detailed descriptions which follow are presented in 
terms of algorithms and symbolic representations of operations on data bits within a 
computer memory. These algorithmic descriptions and representations are the 
means used by those skilled in the data processing arts to most effectively convey 
the substance of their work to others skilled in the art. An algorithm is here, and 
generally, conceived to be a self-consistent sequence of steps leading to a desired 
result. The steps are those requiring physical manipulations of physical quantities. 
Usually, though not necessarily, these quantities take the form of electrical or 
magnetic signals capable of being stored, transferred, combined, compared, and 
otherwise manipulated. It has proven convenient at times, principally for reasons of 
common usage, to refer to these signals as bits, values, elements, symbols, 
characters, terms, numbers, or the like. 

It should be borne in mind, however, that all of these and similar terms are to 
be associated with the appropriate physical quantities and are merely convenient 
labels applied to these quantities. Unless specifically stated otherwise as apparent 
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from the following discussion, it is appreciated that throughout the description, 
discussions utilizing terms such as "processing" or "computing" or "calculating" or 
"determining" or "displaying" or the like, refer to the action and processes of a 
computer system, or similar electronic computing device, that maiupulates and 
5 transforms data represented as physical (electronic) quantities within the computer 
system's registers and memories into other data similarly represented as physical 
quantities within the computer system memories or registers or other such 
information storage, transmission or display devices. 

The present invention also relates to apparatus for performing the operations 

r J 

10 herein. This apparatus may be specially constructed for the required purposes, or it 
'fQ may comprise a general purpose computer selectively activated or reconfigured by a 

computer program stored in the computer. Such a computer program may be 
stored in a computer readable storage medium, such as, but is not limited to, any 
type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical 

-a 

in 15 disks, read-oiUy memories (ROMs), random access memories (RAMs), EPROMs, 

-Q 

EEPROMs, magnetic or optical cards, or any type of media suitable for storing 
electronic instructions, and each coupled to a computer system bus. 

The algorithms and displays presented herein are not inherently related to 
any particular computer or other apparatus. Various general purpose systems may 
20 be used with programs in accordance with the teachings herein, or it may prove 

convenient to construct more specialized apparatus to perform the required method 
steps. The required structure for a variety of these systems will appear from the 
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description below. In addition, the present invention is not described with reference 
to any particular programming language. It will be appreciated that a variety of 
programming languages may be used to implement the teachings of the invention as 
described herein. 



A publishing system is described that has a source of documents, a filter to 
filter the documents so that only selected documents are published, and a web 
server to allow access to selected documents. In one embodiment, the filter 
transforms the appearance of the document and notifies others that publication has 
taken place, e.g., by email. In one embodiment, the filter comprises a computer that 
stores and sends an electronic document to a server to "publish" the documents. 
The document is accessed using a document identifier (e.g., a URL). A user name 
and a password or other security key may also be generated to access the electronic 
document. The URL and any user name and password are sent to an authorized 
user (viewer), thereby enabling the authorized user to access the document. The 
user name and password or security key may be encoded in the URL itself. Users 
can optionally publicize a document by emailing the URL to a World Wide Web 
indexing service (e.g., Yahoo, Altavista, Excite, or an existing publicity service). 

In one embodiment, the server receiving the document automatically deletes 
it once a predetermined condition has been met. The condition may relate to the 
nimiber of times a document has been accessed, (e.g., after the document has been 
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accessed once, a certain number of accesses), amount of time (e.g., 10 days) from the 
date of first access or publication, or after the document has been successfully 
downloaded by a user from a specified internet address (IP address), etc. The 
condition may be established by the publisher of the document or a user. 

5 Figure 1 is a block diagram of one embodiment of a publishing system. The 

publishing system may comprise a multimedia World Wide Web publishing system 
where the documents being published are multimedia docximents. For example, 
these documents may include scanned images, sound and /or video clips. 

£3 

;j Referring to Figure 1, document source 5 generates documents. In one 

^9 10 embodiment, document source 5 may be capable of performing multiple functions 

Co 

ft I 

p?? such as, for example, copying, faxing, e-mailing, printing, or scanning an electronic 

= document. The documents may be generated and published as part of performing 

C3 

p that fimction(s). For example, if document source 5 comprises a copier, the 

S.i 

^ document that is generated for potential publication is the result of performing the 

15 copying operation. Saving a copy of a document as a side effect of another process 
(e.g., copying, printing, faxing, etc.) in the absence of an explicit command by a user 
to dp so is referred to herein as "unconscious" capture. 

In one embodiment, document source 5 comprises a multifunction machine, 
workstation, or scanner. Document source 5 may have a specifically designed 
20 touchscreen interface that controls the publication process. Options available on the 
interface may include (1) specifying destination web server; (2) specifying level of 
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security; (3) specifying publicity; (4) specifying email destinations for published 
URL (document identifier) and password; (5) specifying local output of a document, 
including printing, faxing, and emailing. Document source 5 may also include 
facilities for composition of multimedia documents such as audio and video capture 
5 devices. 

Document source 5 may comprise a printer driver. In such a case, users print 
to a "webpublish" printer. The document, however, is actually not printed, i.e., 
paper would not actually be generated, but the postscript or other printer language 
£3 for the docviment would be transferred to filter computer 10. Similarly, in one 

*. n 

^ 10 embodiment, users email the document as an attachment to the "webpublish" 
address. 
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The generated electronic documents are sent to and received by a filter 
^ computer 10. Filter computer 10 comprises a document transformation filter lOA to 



. Ft 



^ g transform docimients into a format suitable for publication. For example, in one 

i 

15 embodiment, filter computer 10 may convert Postscript documents into HTML. 

That is, in one embodiment, software executing on filter computer 10 transforms the 
documents from document source 5. 

In one embodiment, document transformation filter lOA ensures that the 
document meets certain standards established by a publisher such as the electronic 
20 document is free of pornographic or violent material. Additionally, filter computer 
10 may prevent documents designated as ''confidential" from being published 
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unless conditions specified by the publisher are met. Filter computer 10 may also 
filter other information that a publisher wishes to prevent from being published 
outside of a firewall. Thus, filter computer 10 filters a document of elements that a 
publisher specifically requests not to be placed on server 15, such as, for example, 
5 material deemed to be pornographic, violent, or information designated as 

proprietary by the publisher. Alternatively, a docimient may be filtered at a work 
station used to compose the electronic document. Also filter computer 10 may 
screen for and remove viruses, especially so called "macro" viruses from, for 
example, Microsoft Word or PowerPoint documents. Techniques for virus removal 
^ 10 are well-known in the art. 
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Filter computer 10 may also have a database lOB for storing and organizing 
multiple documents. In one embodiment, filter computer 10 is capable of hosting 
multimedia data used in composing the document. Filter computer 10 may be 
embedded either in a multi-functional machine or a single function device, such as, 



^ 15 for example, a copier, a facsimile, a printer, a scanner, a smaller device, etc. 

Filter computer 10 sends the document(s) to server 15, which may be outside 
a firewall, for publication. In one embodiment, the connection between filter 
computer 10 and server 15 may be implemented with an Internet network 
connection. In another embodiment, this connection may be by a dialup bulletin 
20 board server. 
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After the electronic document is stored on server 15, a user using an interface 
on a client (not shown) is able to gain access to the electronic document. In one 
embodiment, the user accesses the document with a document identifier (e.g., a 
URL). In an alternate embodiment, the URL and an associated user name and 
password, or security key, are required to access the document. After the user 
inputs the URL and any necessary user name and password, the user is able to 
access the electronic docimient. A user may be notified of the document identifier 
and/ or the password using an e-mail system. Server 15 may also enforce different 
levels of security. 

In one embodiment, filter computer 10 returns the URL for the published 
document to the original publisher, perhaps by email. Optionally, users of the 
document source 5 could indicate a password for the document on the print 
command line. 

Server 15 may be located on a private network. In an alternate embodiment, 
server 15 may be a dialup bulletin board server. Server 15 may be structured as a 
collection of individual documents or it could be set up as a searchable database of 
documents. In the latter case, users may be provided with their own home pages 
that are portals to each user's "collection of documents." A publisher will have 
access to a list of allowable users at the time at which a document is to be published. 

A software system manager for server 15 may be included. The software 
system manager may run in the background as a Windows Service or Unix daemon. 
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The system manager monitors the access logs and updates a database of information 
about who or what systems access which documents at what times. This 
information would be made available to publishers so they could monitor when 
their documents are accessed. Thus, server 15 records the number of times a 
5 document is accessed, the time of each document access, and /or who obtained 
access to a document. Server 15 may also notify a publisher that the document has 
been accessed and specify who accessed it. 

Other common system management functions that may be performed by the 
□ system manager would include deleting documents. After the electronic document 

^ 10 has been accessed by a user a predetermined niomber of times (e.g., 1, 2, etc.), the 

system manager server 15 may automatically delete the doctmient depending upon 



m 



i a the instructions provided by the publisher or by a user (depending on the 
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implementation) to server 15. The deletion of the document may be based on a 
specified condition {e.g., the document is deleted after a certain period of time, after 
15 all of the authorized users have accessed the document, etc.). For example, at the 
time documents are published, users would have the option of setting a deletion 
date. When that date arrives, the system manager would automatically delete the 
indicated document. This would support the transient use of published documents. 

When a document is published, such as when the document is sent to a user's 
20 collection of documents, notification may be automatically sent to that user 

indicating that a document has been sent to his collection of documents. Automatic 
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notification may be based upon an event designated by the publisher (or user), such 
as a time period, a condition precedent, a condition concurrent or a condition 
subsequent. The notification may occur through e-mail by an e-mail based 
document manager which is part of server 15. The e-mail based manager may have 
sorting, sending, and receiving capabilities. 

In order to receive e-mail notification, users may need a computer executing a 
program that allows for monitoring and detecting of incoming e-mail. In one 
embodiment, the software program executing on the user's computer detects receipt 
of a message notifying the user that he has received e-mail. In one embodiment, the 
user's computer may execute software that performs an automatic updating process 
that copies the published document to the user's computer. Such software may be 
triggered by receipt of the notification message. In one embodiment, the software 
executing on the computer also sends a command back to web server 15 to remove 
the published document. 

Server 15 may also include an email-based document manager to allow 
publishers to send an email message to server 15 to obtain a listing of published 
documents, to add or delete passwords or to delete the docimient. 

Figure 2 shows client 100 and server 200 with a firewall (optional) between 
client 100 and server 200. Client 100 has a central processing unit 110, a user input- 
output device or interface 120, at least one storage unit 130, an operating system 140, 
and client software 150. 
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Server 200 contains a central processing unit ("CPU") 210, a user input-output 
220, a storage unit 230, an operating system 235, and computer programs for 
execution thereon. In one embodiment, storage 230 maintains documents and access 
logs. Access logs record iiiformation such as the time a docxmient is accessed, the 
number of times a docimients has been accessed, and by whom. 

The server may include a library of software modules that allow for 
progressive display of large document images as well as a collection of plug-ins that 
can be used to display the multimedia data in published documents. Figure 3 
illustrates modules that may be used in a computer program for the server 15 for 
easy access and use of the documents stored thereon. Compression/ decompression 
module 300 is used for compressing or decompressing a large quantity of document 
data or multiple documents stored on server 15. Additionally, progressive 
transmission and display module 310 may be used for displaying a reduced 
resolution version of a document image while the system loads a higher resolution 
version of the image (e.g., one page of the document is shown at one time and then 
the next page is transmitted). Encryption/ decr5^tion module 320 adds additional 
security measures to documents, such as, for example, encrypting or decrypting a 
document, including public key services. Browser module 330 generates and 
operates a browser, including a customized browser 330 that may be placed on the 
server for access by the user. 
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Moreover, a customized display utilities module 340 allows a user to 
customize the type of display for certain types of documents on the server. For 
example, some users may require very complex display utilities based on the type of 
services required by the user. The user may, for example, be a graphic artist who 
needs significant color, sound, and sight capabilities in order to construct a complex 
media document. This module allows that user to set the type of display parameters 
that he requires. Modules could be customized by the publisher 350 to allow a 
publisher to provide customized utilities for his users or for the publisher himself. 
This may include such items as performing complex mathematical calculations in a 
docximent. System manager module 360 has modules for performing various 
functions such as maintairung the access logs in which information is recorded such 
as how many times a document has been accessed, when a document was accessed, 
and who accessed the document. Additionally, the server manager module 360 may 
notify a publisher that a document has been accessed or it may notify the user that 
an e-mail has been sent and its location on that server. 

Figure 4 is a flow chart of the operation of the publishing system. Referring 
to Figure 4, the process begins by generating an electronic document, including 
transforming a document into an electronic format (e.g., HTML) if necessary 
(processing block 401). The document is then stored on a server (processing block 
402). After storing the document (or at the same time), a URL and a password are 
generated to enable access to the docimient (processing block 403). The URL can be 
generated at the source automatically by, e.g., maintaining a counter that is 
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incremented for each document published at that source. Such a URL might look 
like http: / / www.ricoh.com/45.html for the 45th published document. The URL 
could also be generated manually at the source by allowing the user to enter a name 
for the document. In addition to the automatic URL generation method discussed 
5 above, URLs can be generated automatically on the server by generating a long 

random number. This has the advantage of providing some level of security. It will 
be difficult for anyone trying to break into the system to guess such a randomly 
generated URL. 

The password can be generated by the user or publisher at the source. It can 
10 also be generated automatically by the source or filter computer. Such generation 
may be random generation. 

In an alternative embodiment, the resource locator (e.g., URL) and password 
may be generated prior to storing the document on the server. 



if% 

The URL and password are then forwarded to one or more users (processing 
15 block 404). In one embodiment, these are forwarded via email. These may be 
forwarded separately and may be forwarded using offset communication 
mechanisms. 

The URL and (optionally) the password can be emailed by the publisher user 
to intended recipients. If done from the console of a multifunction machine, the user 
20 interface would include functionality for entering the email addresses or fax phone 
number of intended recipients. The email or faxing could be performed by the filter 
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computer or the server. In an alternate embodiment, after the document is stored on 
the server, the URL and password are returned to the source (e.g., multifunction 
machine). It then performs the email or faxing of the URL. This has the advantage 
of supporting remote publishing with locally controlled publicity. The server might 
5 be a public system in which the publisher user does not want to share his list of 
recipients. It might be a confidential customer list, for example. 

Once the URL and password have been obtained, the user accesses the 
document (processing block 405). The server tests whether a condition has been met 
C3 (e.g., all authorized users have accessed the document, etc.) (processing block 406). 

^ 10 If so, then the server deletes the document (processing block 407) and the processor 



n 5 



ends. 

In the preceding detailed description, the invention is described with 
reference to specific embodiments thereof. It will, however, be evident that various 
modifications and changes may be made thereto without departing from the 
15 broader spirit and scope of the invention as set forth in the claims. The specification 
and drawings are, accordingly, to be regarded in an illustrative rather than a 
restrictive sense. 
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